In this paper, three hardware efficient architectures to perform multi-level 2-D discrete wavelet transform (DWT) using lifting (5, 3) and (9, 7) filters are presented. They are classified as folded multi-level architecture (FMA), pipelined multi-level architecture (PMA), and recursive multi-level architecture (RMA). Efficient FMA is proposed using dual-input Z-scan block (B1) with 100% hardware utilization efficiency (HUE). Modular PMA is proposed with the help of block (B1) and dual-input raster scan block (B2) with 60% to 75% HUE. Block B1 and B2 are micro-pipelined to achieve critical path as single adder and single multiplier for lifting (5, 3) and (9, 7) filters, respectively. The clock gating technique is used in PMA to save power and area. Hardware-efficient RMA is proposed with the help of block (B1) and single-input recursive block (B3). Block (B3) uses only single processing element to compute both predict and update; thus, 50% multipliers and adders are saved. Dual-input per clock cycle minimizes total frame computing cycles, latency, and on-chip line buffers. PMA for five-level 2-D wavelet decomposition is synthesized using Xilinx ISE 10.1 for Virtex-5 XC5VLX110T field-programmable gate array (FPGA) target device (Xilinx, Inc., San Jose, CA, USA). The proposed PMA is very much efficient in terms of operating frequency due to pipelining. Moreover, this approach reduces and totals computing cycles significantly as compared to the existing multi-level architectures. RMA for three-level 2-D wavelet decomposition is synthesized using Xilinx ISE 10.1 for Virtex-4 VFX100 FPGA target device.
展开▼
机译:在本文中,提出了使用提升(5,3)和(9,7)滤波器执行多级2-D离散小波变换(DWT)的三种硬件有效架构。它们分为折叠式多层体系结构(FMA),流水线多层体系结构(PMA)和递归多层体系结构(RMA)。提出了使用双输入Z扫描块(B1)和100%硬件利用率(HUE)的高效FMA。借助模块(B1)和具有60%至75%HUE的双输入光栅扫描模块(B2),提出了模块化PMA。对B1和B2块进行微管线处理,以实现关键路径,分别用作提升(5,3)和(9,7)滤波器的单加法器和单乘法器。 PMA使用时钟门控技术以节省功耗和面积。借助块(B1)和单输入递归块(B3)提出了具有硬件效率的RMA。块(B3)仅使用单个处理元素来计算预测和更新;因此,节省了50%的乘法器和加法器。每个时钟周期双输入,最大程度地减少了总的帧计算周期,延迟和片上线路缓冲器。使用用于Virtex-5 XC5VLX110T现场可编程门阵列(FPGA)目标器件(Xilinx,Inc.,San Jose,CA,USA)的Xilinx ISE 10.1合成了用于五级2-D小波分解的PMA。由于流水线操作,建议的PMA在工作频率方面非常高效。此外,与现有的多级体系结构相比,该方法显着减少并总计了计算周期。使用用于Virtex-4 VFX100 FPGA目标器件的Xilinx ISE 10.1合成了用于三级二维小波分解的RMA。
展开▼